Microsoft Word - A novel super-wideband embedded speech and audio c

نویسندگان

  • Mao-shen JIA
  • Chang-chun BAO
  • Xin LIU
  • Rui LI
چکیده

This paper proposes a multi-layer super-wideband embedded speech and audio coding algorithm extending bit rates from 36 to 64 kb/s on the basis of ITU-T Recommendation G.729.1 with a multi-stage coding structure. This codec consists of three embedded stages: G.729.1 wideband coding operating in the range from 8 to 32 kb/s, modified Modulated Lapped Transform (MLT) coding of the band (7-14 kHz) at 36, 40 & 48 kb/s and MDCT transform coding for wideband residual signal at 56 and 64 kb/s. In addition, some methods are proposed in transform coding according to perception significance. The objective and subjective listening tests show that this codec has good performance compared with reference codec.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

System for Speech Transcription and Post-Editing in Microsoft Word

In this demonstration paper, we introduce a transcription service that can be used for transcription of different meetings, sessions etc. The service performs speaker diarization, automatic speech recognition, punctuation restoration and produces human-readable transcripts as special Microsoft Word documents that have audio and word alignments embedded. Thereby, a widely-used word processor is ...

متن کامل

Mandarin-English Information (MEI)

Mandarin-English Information (MEI) is one of the four projects selected for the Johns Hopkins University Summer Workshop 2000. We plan to develop technologies for using written queries to search spoken documents (cross-media) between English and Mandarin Chinese (cross-language). Our research focus is on the integration of speech recognition and machine translation technologies in the context o...

متن کامل

Audio bandwidth extension using ensemble of recurrent neural networks

In audio communication systems, the perceptual audio quality of the reproduced audio signals such as the naturalness of the sound is limited by the available audio bandwidth. In this paper, a wideband to super-wideband audio bandwidth extension method is proposed using an ensemble of recurrent neural networks. The feature space of wideband audio is firstly divided into different regions through...

متن کامل

An audio watermark-based speech bandwidth extension method

A novel speech bandwidth extension method based on audio watermark is presented in this paper. The time-domain and frequency-domain envelope parameters are extracted from the high-frequency components of speech signal, and then these parameters are embedded in the corresponding narrowband speech bit stream by the modified least significant bit watermark method which uses perception property. At...

متن کامل

Super-Wideband Bandwidth Extension for Wideband Audio Codecs Using Switched Spectral Replication and Pitch Synthesis

This paper describes a new bandwidth extension algorithm which is targeted at high quality audio communication over IP networks. The algorithm is part of the Huawei/ETRI candidate for the ITU-T super-wideband (SWB) extensions of Rec. G.729.1 and G.718. In the SWB candidate codec, the 7-14 kHz frequency band of speech and audio signals is represented in terms of temporal and spectral envelopes. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009